-
Notifications
You must be signed in to change notification settings - Fork 3.5k
fix: ollama detect context length automatically #7702
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
RomneyDa
merged 5 commits into
continuedev:main
from
uinstinct:ollama-memory-contextlength
Sep 17, 2025
Merged
fix: ollama detect context length automatically #7702
RomneyDa
merged 5 commits into
continuedev:main
from
uinstinct:ollama-memory-contextlength
Sep 17, 2025
+6
−0
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
4096 might be a bit too restrictive, wondering if we should do 8192 |
I think 20% is fine, 4096 makes sense for text in text out but for an agent application it's too restrictive |
RomneyDa
approved these changes
Sep 17, 2025
🎉 This PR is included in version 1.19.0 🎉 The release is available on: Your semantic-release bot 📦🚀 |
🎉 This PR is included in version 1.16.0 🎉 The release is available on: Your semantic-release bot 📦🚀 |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Use the Ollama configured default context length of 4096 (instead of Continue's 32,768).
resolves CON-3817
AI Code Review
@continue-general-review
or@continue-detailed-review
Checklist
Screen recording or screenshot
before
after
Tests
[ What tests were added or updated to ensure the changes work as expected? ]
Summary by cubic
Use Ollama’s actual context length: we now default to 4096 (Ollama’s config) and use the model-provided value when available, instead of Continue’s 32,768. This prevents overestimating the token window and aligns with CON-3817.